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The demonstration and use of nonlocality, as defined by Bell’s theorem, rely strongly on deal¬ 
ing with non-detection events due to losses and detectors’ inefficiencies. Otherwise, the so-called 
detection loophole could be exploited. The only way to avoid this is to have detection efficiencies 
that are above a certain threshold. We introduce the intermediate assumption of limited detection 
efficiency, that is, in each run of the experiment, the overall detection efficiency is lower bounded by 
rjmin > 0. Hence, in an adversarial scenario, the adversaries have arbitrary large but not full control 
over the inefficiencies. We analyse the set of possible correlations that fulfill Limited Detection 
Locality (LDL) and show that they necessarily satisfy some linear Bell-like inequalities. We prove 
that quantum theory predicts the violation of one of these inequalities for all rimin > 0. Hence, non¬ 
locality can be demonstrated with arbitrarily small limited detection efficiencies. We validate this 
assumption experimentally via a twin-photon implementation in which two users are provided with 
one photon each out of a partially entangled pair. We exploit on each side a passive switch followed 
by two measurement devices with fixed settings. Assuming the switches are not fully controlled 
by an adversary, nor by hypothetical local variables, we reveal the nonlocality of the established 
correlations despite a low overall detection efficiency. 
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Introduction — When studying the discoveries in fun¬ 
damental physics of the past century, one cannot help 
but come across Bell’s seminal work on the nonlocal na¬ 
ture of quantum theory [T]. It implies that quantum 
physics can produce correlations which cannot be ex¬ 
plained by a common past with local variables propagat¬ 
ing contiguously. This has not only proven fascinating 
from a foundational point of view, but also given rise to 
applications in device independent quantum information 
processing (DIQIP) [5], such as quantum key distribu¬ 
tion ma, randomness generation inna, or entanglement 
certification [SllS]. For a semi broad-audience presenta¬ 
tion of these concepts, see [10]. 

Let us briefly recall the concept of local and nonlocal 
correlations. Assume that a source emits pairs of par¬ 
ticles that travel to two distant stations, say Alice and 
Bob. As depicted in FIG. the two experimentalists 
perform one out of several possible measurements on the 
individual particles they receive and record the associ¬ 
ated outcomes. We denote Alice’s and Bob’s measure¬ 
ment choices by x and y and their recorded outcomes by 
a and b, respectively. They can then compute the cor¬ 
relation P{ab\xy). Given the setup, it seems natural to 
think that any correlations that Alice and Bob can ob¬ 
serve in this way are due to particles having a common 
past, as they come from the same source. We refer to this 
common past by A. Correlations that can be explained 
by the existence of such a parameter are called local: 

PL{ab\xy) = J dXp{X)P{a\xX)P{b\yX). (1) 



FIG. 1. Two boxes receive each a particle, emitted by a com¬ 
mon source, each. They are given inputs x and y, which 
we depict here as the setting of an active switch, and return 
outputs a and 6, respectively. There is the possibility for 
non-detection events, in which case the corresponding output 
variable takes the value 0. Since these losses can be seen as 
happening inside the box, they can depend on the inputs x 
and y, respectively. We analyze the limited detection local 
case of this scenario, meaning that a local hidden variable A 
not only fully describes the state of the particle, but can also 
infiuence whether or not a non-detection event occurs. 


Bell’s work shows that there are quantum correlations 
that cannot be reproduced by such a local model, proving 
that quantum physics is inherently nonlocal. Nonlocality 
has since been demonstrated many times |2], and finds 
repercussion in a variety of applications [31 [SHE] . 

However, when demonstrating quantum nonlocality, 
several issues have to be addressed. Here, we are specif¬ 
ically interested in one of them: what happens if the 
particles can be lost on the way to or inside the measure- 
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merit stations, including the possibility that they reach 
the detectors but are simply not registered. In this case, 
we say that a = 0 or b = 0 . One immediate idea is of 
course to carefully analyze why the particles get lost and, 
if the mechanisms are well understood, to simply discard 
these cases. This means that Alice and Bob postselect on 
the cases for which they both register a detection event: 
a ^ 0 and 6^0. Notably, this opens up the possi¬ 
bility that fully local correlations appear nonlocal if our 
understanding of the cause of the non-detection events is 
incorrect, a situation that we wish to avoid [TT 1 IT 2 ]. This 
is especially relevant in the case of active adversaries in 
DIQIP applications. Another option is to consider the 
non-detection events as an additional possible outcome 
and simply check if the resulting correlation is nonlocal. 
In this case, one will never mistake a local correlation 
for a nonlocal one. The drawback, however, is that even 
highly nonlocal distributions may now appear local. 

In the end, the only way address this issue, usually 
called the detection loophole, consists of not only pro¬ 
ducing highly nonlocal correlations but also having a high 
enough detection efficiency [ 5 ]. If the latter is not satis¬ 
fied, even a perfect state preparation and perfectly cal¬ 
ibrated measurement apparatus do not help and one is 
left with an inconclusive experiment unless the detection 
loophole is assumed to be not exploited. It is only re¬ 
cently that all the potential loopholes in a Bell test have 
been closed in a single experiment for the first time m- 
However, closing the detection loophole in long-range ex¬ 
periments and applications remains difficult. Namely, af¬ 
ter a transmission through 10 km of the currently best op¬ 
tical fiber, the losses already exceed the tolerable thresh¬ 
old. We therefore take a different route in this paper. 

Here, we introduce the concept of limited detection 
locality. It consists of an intermediate assumption be¬ 
tween neglecting the detection loophole and closing it 
completely. We show that this assumption, even when ar¬ 
bitrarily weak, allows demonstrating nonlocality by post¬ 
selection with arbitrarily low overall detection efficiency. 
We then connect the concepts of limited detection local¬ 
ity and measurement dependent locality Finally, 

we discuss a dedicated twin-photon experiment that we 
performed, and show how quantum nonlocality can be 
revealed despite low detection efficiencies without having 
to make the fair-sampling assumption. 

Limited Detection Locality (LDL )— Assume that 
there exist a fixed rimin and rjmax with ^ 

[0,1] such that 

Vmin — P{a ^ 0|xA) ^ (2) 

and similarly for Bob. This corresponds to the assump¬ 
tion that, for any input x and any common local variable 
A, there is a probability of at least •qmin and at most rimax 
of having a detection. Consider for example a world in 
which the polarization degree of freedom of photons was 
as of yet undiscovered. It is nowadays well known that 
the detection efficiency of most photonic detectors is in¬ 


deed sensitive to the polarization degree of freedom of 
light. However the detection efficiency never goes up to 
1 or down to 0, which corresponds to our assumption 
of limited detection efficiency with nontrivial and 

Vmax- It is also worth noting that the assumption can be 
imposed at the cost of weakening the established corre¬ 
lations by a partial postselection. We do not discuss this 
further here, but refer to | 16 j for an explanation of this 
idea. We refer to correlations fulfilling conditions 0 and 
0 as limited detection local. Note that technically, the 
case of [rimim Tlmax] = [ 0 , 1 ] Can still be analyzed with our 
model, making it possible to retrieve the results discussed 

in [II]- 

In an experiment, one can additionally determine 
the actual observed detection efficiencies, which may of 
course be different for the different sets of inputs. It is 
necessary that some detections occurred for all possible 
sets of inputs, which reads 

P{a ^ 0|x) = 77^ > 0 . ( 3 ) 

Since 

P(a ^ 0 \x) = j d\p{\)P{a 7^ 0|a:A), ( 4 ) 

we have that rjmin < Vx — Imax- All of this holds 
analogously for Bob’s side. To ease notation, we define 
■nxy = Vxlly- 

We can now focus on the postselected limited detection 
local distributions given by 

P{ab\xy, a 0 ,b ^ 0 ) = _ (5) 

Vxy 

Similarly to local correlations, these postselected lim¬ 
ited detection local correlations fulfill certain conditions. 
More precisely, they form a convex polytope and there¬ 
fore respect a set of linear Bell-like inequalities (T^j . Mak¬ 
ing the additional assumption that ij^y = r]x>yi for all 
x, x', y, y', one of these inequalities is, for example, given 
by 

^Ln^(00|00,« ^ 0,6 ^ 0) 

VminVmax ^^ ^ 6 ^ 0 ) 

10 ; ^ ^ 0 ; 6 ^ 0 ) 

-^L.^(OO|ll,a^0,6^0)<O. (6) 

In an experiment with given losses, the experimental¬ 
ists can check for which values of rjmin and rjmax their 
observed correlations violate this inequality. They can 
then conclude that no limited detection local model with 
these parameters could have reproduced them. 

Interestingly, there are quantum correlations that do 
not fulfill this inequality independently of the observed 
detection efficiencies rj^y and for any upper bound rjmax 
as long as rjmin > 0 . In fact, this is achieved by all quan¬ 
tum correlations violating Hardy’s paradox m and can 
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therefore be realised using any sufficiently pure partially 
entangled 2 -qubit state with the right set of projective 
measurements. This may be quite surprising since it is 
well-known that without the assumption of limited de¬ 
tection efficiency § , a minimal observed detection effi¬ 
ciency of ry = 'Yhxy'^^vl'^ ^ i required to demonstrate 
nonlocality for 2 parties using binary inputs and outputs. 
However, making the arbitrarily weak additional assump¬ 
tion that P[a ^ 0\xX) > rj^in > 0 allows demonstrating 
quantum nonlocality despite arbitrarily large losses and 
detection inefficiencies. 

Link to measurement dependent locality (MDL) — An¬ 
other way to counterfeit nonlocal correlations using only 
local resources is if the common history A is correlated 
with the inputs x and y. If the correlation can be arbi¬ 
trary, then any nonlocal correlation can be counter-fitted 
in this way, so limitations have to be imposed to be able 
to make any conclusions. Together with some co-authors, 
we recently studied the case of measurement dependent 
local correlations HHH] that are defined in the following 
way: 

P{abxy) = j dXp{X)P{xy\X)P{a\xX)P{b\yX) (7) 

£ < P{xy\X) < h. ( 8 ) 

Note that if Alice and Bob each have N inputs, then 0 < 
£ ^ ^ ^ h ^ £ due to the normalization of probability 
distributions. Similarly to this paper, we showed that the 
set of MDL-correlations for fixed £ and h can be analysed 
using Bell-like inequalities. 

It turns out that there exists a strong link between the 
concepts of limited detection locality and measurement 
dependent locality. We make this connection explicit by 
the following theorem, whose demonstration is detailed 
in the Supplemental Material [TB] : 

Theorem: Assume that we have a correlation that can 
be produced by using a combination of postselected lim¬ 
ited detection ([^ and measurement dependent ^ local 
0 resources, with parameters (? 7 mmj Imax) and {£, h), re¬ 
spectively. Then, this correlation can also be reproduced 
using only measurement dependent local resources with 
£' = and h! = 

Intuitively, the link comes from the fact that an adver¬ 
sary that is allowed, via postselection, to discard unsat¬ 
isfying inputs is in fact influencing the choice of inputs. 
A consequence of this theorem is that whenever a cor¬ 
relation cannot be reproduced by a measurement depen¬ 
dent local model with bounds £' and h\ then it can also 
not be realized using limited detection efficiencies with 
’’Pprn > ^ 2 ^/ !?ya 3 L < k' , where N is the number 

of inputs for each of the two parties. This allows us to use 
any result derived for the MDL-scenario and apply them 
to LDL-correlations. Even more interestingly, we are now 
able to deal with the problems of losses and measurement 
dependence in a straightforward way since we can simply 
focus exclusively on measurement dependence. 



FIG. 2. A source emits supposedly entangled photons to¬ 
wards two measurement stations. In each station, a passive 
switch routes the individual incoming photons to one out of 
two pairs of detectors, corresponding to either settings 0 or 
1. The boxes are given inputs x and y, which decide which 
pairs of detectors are switched on, and return outputs a and b 
depending on which detector fires. If the photon goes towards 
the inactive pair of detectors it is simply considered as lost 
and not registered. We compare this scenario to the scenario 
where a local hidden variable A that can influence the losses 
and settings try to emulate our observed correlations. 


Experiment: We now discuss the twin-photon based 
experimental realization whose principle is depicted in 
FIG.[^ Each user’s station comprises 2 measurement de¬ 
vices with fixed settings, i.e., 4 single-photon detectors 
(8 in total). The detection events at Alice’s are space¬ 
like separated from the corresponding ones at Bob’s. In 
order to emulate active switches between different ana- 
lyzis settings at each user’s location without being lim¬ 
ited by driving speeds, we exploit instead a simple and 
elegant solution for passive implementation, referred to 
as Zbinden switches [2]. First, standard 50/50 beam¬ 
splitters connect the incoming photons to the measure¬ 
ment devices. Then, two quantum random number gen¬ 
erators (QRNG) provide the inputs x for Alice and y for 
Bob to randomly select one pair of detectors associated 
with one measurement. The other two detectors of each 
box remain inactive. Gonsequently, when, in any run 
of the experiment, a photon hits an inactive detector, it 
is merely disregarded. Hence, with perfect devices, the 
maximum attainable detection efficiency is 50%. 

Our source produces partially entangled photon pairs 
close to the state 
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(9) 


and the 8 detectors are set up to perform mea¬ 
surements close to the ideal projective measurements 
\Ao{0)) = cos0|O)+sin0|l), |Ai(7) =| Ao(0 - f)), 
|Ho( 0 )) = |Ao(- 7 ) and |Hi( 0 )) = |Ai(- 7 ), with 
+ Further experimental details are 

given in [T] as well as in the Supplemental Material m- 
Without any additional noise, the resulting correlations 
would violate inequality (|^, 'ir\min > 0 and 'irjmax- Ob¬ 
viously, due to noise and imperfections even in the postse¬ 
lected case this is not fully realized. Our postselected cor- 


9 = arccos 
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Bob 




y=0 


1 


X cs b = 0 1 

o 0.01977(12) 0.01003(9) 
^ 0.00830(8) 0.19763(36) 
^ o 0.02536(15) 0.00325(5) 
^ 0.00106(3) 0.23631(41) 


0 1 
0.00435(6) 0.00112(3) 
0.01576(3) 0.24282(41) 
0.00089(3) 0.03583(19) 
0.01411(12) 0.18349(39) 


TABLE I. Measured unconditional probabilities P{abxy), in¬ 
cluding the distribution of the inputs. 


relations, meaning that we discard all the non-detection 
events, can be found in Table |Tj While we did not pre¬ 
cisely determine the overall detection efficiency in our 
experiment, it is worth to note that it stands clearly be¬ 
low the required | efficiency that is necessary to close the 
detection loophole in the case of 2 parties with binary in¬ 
puts and outputs HIS]. However, assuming that the 
losses are the same for all input pairs, i.e., rixy = Tj^’y', 
we can still conclude that we reveal quantum nonlocality 
for > 0.267. This follows from inequality 

There are two ways to illustrate this result. First, as¬ 
sume that a local hidden variable guides, on each side, 
the particle through the beam-splitter and influences the 
detection efficiency. Such a local hidden variable would 
need to have the power to reduce the overall detection 
efficiency down to below 26.7% in order to mimic non¬ 
local correlations. Next, consider an adversary partially 
controlling the detectors’ efficiencies. Assuming standard 
50/50 beam-splitters, we have that rimax < 0.5 and one 
gets rimin = 0.134, i.e., such an adversary would have 
to lower the detector efficiencies corresponding to set¬ 
tings they do not like down to below 13.8% in order to 
have local correlations appear nonlocal. In our experi¬ 
ment, due to the beam-splitters, the losses in the fibres 
and the difficulties of perfectly aligning 8 detectors, we 
can safely consider that ijmax <0.1, leading to a required 
Vmin < 0.027 in order to reproduce the correlations using 
a local hidden variable model. 

Additionally, the way our experiment was built and 
carried out clearly shows the link between limited detec¬ 
tion efficiency and measurement dependence. At both 
Alice’s and Bob’s stations, the respective beam-splitter 
sends the photon to either the detectors corresponding 
to input 0 or to the ones corresponding to input 1. The 
inputs X and y determine which pair of detectors we con¬ 
sider as switched on each side. If the photon goes towards 
the detectors for input 1 while we input 0, then the de¬ 
tection is not registered and the photon is lost. The non¬ 
detection events thereby allow the photons to ’choose’ the 
input they will reply to. The limited detection assump¬ 
tion then corresponds to limiting this choice, i.e., while 
the action of the beam-splitter can be biased, it cannot 
be fully biased. The link to measurement dependence is 
here obvious. 

We can use theorem 1 to extend our analysis to the 
case where we consider our random number generators 


providing the inputs x and y to be biased. Let us assume 
that there is some measurement dependence as defined in 
([^, given by parameters {£, h), as well as some limited de¬ 
tection efficiency, given by parameters {r]mini Vmax) ■ Us¬ 
ing an MDL inequality provided in m and theorem 1, 
we then conclude that our experiment shows quantum 
nonlocality for 


£ 

h 


> 0.15529. 

Vmax / 


( 10 ) 


Conclusion — Losses and detection inefficiencies have 
been serious obstacles in all experiments dedicated to test 
Bell’s theorem. This is why a loop-hole free Bell-test 
took so long to be successfully demonstrated. They are 
also part of the main weak points that an adversary may 
attack in any task whose security relies on quantum non¬ 
locality. To help dealing with both of these issues from 
a theoretical point of view, we have introduced the addi¬ 
tional assumption of limited detection efficiency ([^ . The 
assumption is that the inefficiencies in the setup are only 
partially exploited, an idea that we consider very intu¬ 
itive in itself especially in the case of a fundamental Bell 
test, provided Nature is non-malicious. The case of an 
adversary is also interesting. If one assumes an adversary 
with full control over Alice’s and/or Bob’s devices, then 
one needs to resort to full Device-Independent Quantum 
Information Processing (DIQIP) HHj- However, semi- 
DIQIP is extremely appealing and timely as it allows one 
to lower the extremely high requirement of full DIQIP. 
For example, considering DIQIP as a way to check the 
proper functioning of our system, i.e., going back to the 
original idea of self-testing EDI, it is very natural to as¬ 
sume that possible defects do not set rjmin = 0. In such a 
case, our approach provides much simpler self-tests. Even 
against an active adversary, there are cases in which it is 
reasonable to assume that they have only partial control 
on the detection efficiencies. 

Making the fair-sampling assumption when it is not 
warranted can lead to disastrously wrong conclusions |I21 
[2T1 [22] . The same holds for the LDL assumption in cases 
where it is not justified. However, due to the fact the 
LDL assumption is strictly weaker than assuming fair¬ 
sampling, it can clearly hold and be justified more often. 
In other words, if an adversary is not able to fully sup¬ 
press the detection events, i.e., rimin > 0, nonlocality can 
be demonstrated and exploited. 

We connected the ideas of limited detection locality 
and measurement dependent locality. We showed that 
results from studying measurement dependent local cor¬ 
relations can be applied to the case of limited detection 
locality. Moreover, it is possible to deal with detection 
efficiencies and lack of measurement independence at the 
same time, which we hope will be of use for future Bell 
experiments. 

Finally, we carried out an experiment to bring these 
theoretical ideas to the real world. Our twin-photon 
based demonstration, having detection efficiencies too 
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low to close the detection loophole [T3], remains conclu¬ 
sive as long as we assume a limited detection efhciency 
such that > 0.267. It furthermore reveals very 

clearly the link between measurement dependence and 
limited detection. We believe that the methods depicted 
here will be of use to any frontier experiments or applica¬ 
tions that do not manage to close the detection loophole 
completely. 

Note added — During the review process, we became 
aware, in addition to Ref. |13j , of two more experimental 
demonstrations related to loophole-free Bell tests [251l24j . 
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Polytopal structure of Limited Detection Local correlations 


Consider the case where N parties perform a nonlocality experiment. The input and outcome of the i-th party 
will be denoted by Xi and Ai respectively. We consider the case where non-detection events can occur, they will 
be denoted by Ai taking the value 0 . We will denote by A[ the outcome of party i after postselecting on having a 
detection. As discussed in the main text, we make the assumption of limited detection locality and show that these 
correlations form a polytope. The theorem stated here is more general than needed for the main text, where we only 
consider the case of 2 parties. 

Definitions: Let {Ai}^^, {WliLi be sets of random variables with alphabets {l---mi, 0 }, 

and respectively, mi,ni G M. In the following, the corresponding lower case letters will denote values in 

the respective alphabet. We will denote probability distributions over a random variable V by Py, the value of this 
distribution for a given value of V by Pv{v). For ease of notation, we will often omit the random variable and just 
write P{v). We will denote conditional probability distributions over a random variable V conditioned on a random 
variable W by Pv\w- In the case of continuous random variable we denote the probability density by pv- In the 
following we assume that all the probability distributions are well defined. The set of all probability distributions 
over V will be denoted by Vy and of all conditional probability distributions over V conditioned on W by Vy\w 

We define the following sets: 

• The sets of 1-party distributions with limited detection: 

PDiijimin^ Pmax) — ^ Ai\Xi • I Pmin ^ PAi\Xi — I Pmax^ ^ € {0 ' ' * ^ 


• The set of Wparty limited detection local distributions: 

CDCN(,\j]min,i\i=\i {Pmax ,i} i~l) = ^PAx...An\Xi...Xm Ai ... Ajv |Xi.. .Xjv ’ . 


P{ai ...gnIxi... xn) = j dA/ 9 (A) P{ai\xi\), 

i—1 

Pa i\XiK=\ € PD i {,Pmin,i 5 Vz ^ 


• The set of Wparty postselected limited detection local distributions: 


C'DCVSN{{V7nin,i}f=y{'nniax,i }f=l,{Vxi...XN}xi...XM) - \J^A'^...A'j^\Xi...Xn € PA'^...A'^\Xi...Xn ' 

e LVL{{'qrmn,i}i=lA'n 
Q{(ll ^ 0 . . . (ZjV 7^ 0\xi ...Xjv) — Vxi 


P{a[ .. .a^jvlxi.. .xat) 


Q{a[ .. .g^jvlxi. ..xn) 
Vxi...Xn 


} 


We further define the following two sets, which we will prove to be the vertices of CDfip^nimilmax) and 

V (j]rnin-i V^nax^ — Ai\Xi ■ ^ {1 ■ ■ ■ G {1 . . . TTli^ and T]x G ^'f]mini VrrLax} S.t. 

V{ax\x) = rjxj V{0\x) = 1 — r]x and otherwise V{a\x) = o| 


V ^ {Vmax,i}i—i) — Ai...An\Xi...Xn ’ 

G V ^ 'Hmax^i) S.t. 


N 

V{ai ...gnIxi. ..Xn) = 

2 = 1 
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With these definitions, we can now state the theorem. It refers to polytopes, which for the purposes of this work 
are simply seen as a convex structure with a finite set of vertices. Equivalently they can be defined by a finite set of 
inequalities. 

Theorem: For fixed N, {r]min,i}iLi and {vTuaxAiLi, {VmaxAiLi) is a polytope 

whose vertices are a subset of ({? 7 min.Jili, {Wx.Jili)- Furthermore, for fixed {rixi...xAxi...x,^), 

1 5 1; ...a:iv ) iS alsO a polytOpe. 

Proof: To ease notation we will omit writing N, {r]min,i}iLi and {'r]max,i}iLi from now on. The first part of the 
theorem follows from the following two lemmas. 

Lemma 1: CT>i is a polytope with vertices given by 

Proof: Due to the normalisation of probability distributions, i.e., 

^ ^ 7^(uzl^z) — 1 Vxj, 

CLi 

we have that P{0\xi) = 1 — and we can therefore work in the lowerdimensional subspace 

given by Oi € {l...TOi}. In this subspace, we are then left with the polytope defined by the inequalities 
'Hmin < PiO'Axi) < rjmax- This is the definition of a hypercube whose vertices are defined by the corresponding 

part of V^®‘. 

The Lemma follows. □ 

Lemma 2 : Let Q and TZ be polytopes with vertices Vq and Vn respectively. Let 5 = |s' : 3A s.t. S{u,v) = 

f dAp(A)QA(u)R^(v) with Q G Q,R€ 7^|. Let V 5 = |Vs : Vsiu,v) = Vq{u)Vr{v) with Vq G Ve,yR G Vn^. 

Then 5 is a polytope whose vertices are a subset of V 5 . 

Proof: By definition, S is convex and V 5 G S. 

Let S G S, then by definition we have: 

S{u,v)= J dAp{A)Qx{u)R\{v) 

= J dAp{A)J2qx^^Vf^{u)J2rx,jVf^{v) 

'i 3 

= J2{ I dAp(A)gA.*rAj)vQ(M)V^,(u) 

(b) 

= ^s,jVg\u,v) 

(b) 

In the first step we use the fact that any element of Q and TZ can be written as a convex combination of their vertices 
and we define q\^i > 0, ^iq\,i = 1 and r\j > 0, = I. In the last step we defined = f dAp{A)q\^irxj, 

which fulfils Sij > 0 and Sij = I and also used the definition of Vg . 

This proves the Lemma.□ 

Using these two Lemmas in conjunction (and using Lemma 2 iteratively) proves that CDC is a polytope and that 
VcPP contains its vertices. 

To finalize the proof of the theorem we need to show that CDCPS is a polytope as well. This can be seen directly 
since the set is obtained by slicing CDC with the hyperplanes defined by P{ai ^ 0 ... 0 |xi .. .xn) = Pxi...xN ■ 
Cutting a polytope with hyperplanes results in another polytope. The final step is a simple rescaling of the entries 
(equivalent to rescaling the axes) and therefore the set remains again a polytope. 


This proves the theorem. □ 



A more general method of dealing with detection inefficiencies- 


It is possible to impose any desired rjmin at the price of adding some noise to the system. Assume that Alice and 
Bob set their detection systems, which we assume to have an efficiency rj, such that any time a non-detection event 
occurs, the system still gives an outcome with the probability rjmin- la this way, Alice and Bob impose that their 
detection systems have a limited detection efficiency given by the chosen rjmin and rjmax = 1 and they can treat the 
resulting correlations by the tools presented above. This however comes at the price of adding local noise to their 
correlations. In fact, assume that Alice and Bob would share the nonlocal correlation Pml if the detectors were 
perfect and there were no losses {e.g., given by projective measurements on a pure quantum state) and denote by 
and P^i^ the marginal distributions of Alice and Bob, respectively. In the non-detection cases, the detection 
systems are set up such that they give with probability rjmin an outcome given by the local distributions P^ and , 
respectively. Then, by postselecting on the cases where the detection systems gave an outcome, Alice and Bob share 
the correlation: 


P = 


(ji'^PnL + ??(I — 'll)Vmin{PNLPL + 


PtPNL) 


+ (^-^)'^tninPLPE I X 2 - 

{V “I” (1 


( 11 ) 


They can then analyze this correlation using the tools of limited detection efficiency presented above. 

One possibility of dealing with losses and detector inefficiencies is to assign the non-detection events to an additional 
outcome and treat the resulting correlations using the usual tools of nonlocality. This corresponds exactly to the 
strategy we just presented with rjmin = 1- However, our approach is more general, allowing to assign only a fraction of 
the non-detection events to an outcome and postselecting on the rest. For a fixed detection efficiency rj, our method 
therefore encompasses both of the previous strategies, full postselection and full assignment to an additional outcome, 
and additionally allows for an arbitrary mixture of the two. It is at this point not obvious to us that for a given 
experiment (meaning for a given Pjvl and a given 77 ), all of these strategies would yield the same result. We leave it 
up for future works to analyze this question in more detail. 


Limited Detection Locality and Measurement Dependent Locality 

In this section we prove the fink between limited detection local and measurement dependent local distributions. 
This can be proven more generally, here we only present the 2-party version. 

Definitions: We introduce the random variables Da and Db with alphabet {0,1} such that Da = 0 if and only if 
A = 0. We define the set of limited detection local distributions allowing for measurement dependence: 

MVCDC{e,h, rjmin, Vniax) = {PadaBDbXY --PiadAbdBxy) = J dXp{X)P{xy\X)P{adAbdB\xyX), 

Vfnin ^ PoADBlXYAi^MxyX) < 

^max j 

P{adAbdB\xyX) = P{adA\xX)P{bdB\yX) 

I < P{xy\X) < h, 

I dAp(A) = l,p(A)>0} 

We also define the set of measurement dependent local correlations: 

MT)L{h,t) = {Pabxy ■■P{abxy) = J dXp{X)P{xy\X)P{ab\xyX), 

P{ab\xyX) = P{a\xX)P{b\yX) 
i < P{xy\X) < h, 

J dAp(A) = l,p(A)>0} 

Theorem: If 


PaDaBDbXY € TT)^ h^ Tfruin •!‘^rnax) 
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then 


Pabxy\Da=i,Db=i 


e MvcC^t, 

Vmax 


^max 

Vmin 


h). 


Proof: We have 

1 . P{adAbdBxy) = f dAp(A)P(xylA)P(adAbdBlxyA) 

2. P(adAbdBlxyA) = P(adAlxA)P(bdBlyA) 


3 - f]min < PoADBlXYAi^MxyA) < ^max 

A. (. < P{xy\A) < h. 

Let us prove a few implications: 

• If {ADa\X) and {BDb\Y) are local, meaning that they fulfil condition 2, then {Da\X) and {Db\Y) are also 
local: 


P{dAdB\xyA) = E P{adAbdB\xyA) 

a,b 

= ^ P{adA\xA)P{bdB\yA) 

ab 

= P{dA\xA)P{dB\yA). 


• If {ADa\X) and {BDb\Y) are local, then {A\DaX) and {B\DbY) are also local: 


P{ab\xdAydBA) 


P{adAbdB\xyA) 
P{dAdB\xyA) 
P{adA\xX) P{bdB\yA) 
P{dA\xA) P{dB\yA) 
P{a\xdAA)P{b\ydBA). 


• Knowing less cannot result in knowing more, meaning that upper and lower bounds on P(/i|i/(T) also hold for 
P{fj,\i'): Assume < h, then 


a 

< h^P{a) 

<7 

= h 

where we used that ^^P{<x) = I- The same holds for lower bounds £ < P{y\i'a). Due to this, condition 3 
implies 


Pmin Xi PoADBlAi^M^) < Vr, 


Using the implications above, we can show that the conditions imply bounds on P{xy\DA = A,Db = 1, A): 


P{xy\DA = 1 ,Db = 1,A) = 


P{DA = l,DB = l\xyA) 


P{Da = 1,Db = 1\A) ^ 


P{xy\A) 






< Pixy\DA = 1 ,Db = 1, A) < 


Vmax 


Vmin 
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FIG. 3. Experimental setup - SL: spherical lenses; HWP: half-wave plate; BS: birefringent crystal; CL: cylindrical lens; BBO: 
beta-barium-borate crystals; UVF: ultra-violet filter; CC: compensation crystal; L: lens; PC: polarization controller; BS: 50/50 
beam-splitter; WP: Wollaston prism; D: single photon detector; QRNG: Quantum random number generator; Daq&P: Data 
acquisition & processing. 


We can now prove the theorem: 

P{abxy\DA = 1,Db ^ 1) = j dXp{\\D a = 1,Db = l)P{xyWDA = 1,Db = X)P{ab\xy\DA = 1,Db = 1) 




— — Vn 


= J dXp{\\DA = 1,Db = 1) P{xy\X, Da = 1,Db = 1) 
P{a\xX,DA = l)P{b\yX,DB = l) 


This is by definition an MDL-correlation: 

P{ABXY\Da = 1,Db = 1)€ 

^max ^min 


Experimental details 

To produce the twin-photon partially polarisation entangled state, we employ the source depicted in the top-left 
hand side of FIG.[^ A pump laser at 404 nm is sent to two cascaded type-I BBO crystals to produce photon pairs 
at 808 nm in a coherent superposition of two orthogonal polarisations via the process of spontaneous parametric 
down-conversion. To generate the twin-photon state of eq. (10) in the maintext, we set the desired weights of the 
coherent superposition and the phase between \VV) and \HH) by placing a half wave-plate and a tilted birefringent 
crystal in the path of the pump laser. The generation process introduces some distinguishability between the two 
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components of the state which is erased by an adapted compensation crystal. Moreover, to remove the discernibility 
in terms of spatial modes (k-vectors), the paired photons are coupled, each, into a single mode fibre. Thanks to this 
configuration, we can produce the desired state with a fidelity of 99% (see also Ref. [1] for more details). 

To perform the measurements, Alice and Bob have each two polarization analysers with fixed settings, defining two 
bases, {Ao,Ai} and {Bo,Bi}, respectively. These analysers are made of a half wave-plate and a Wollaston prism. 
Each prism of the setup has an extinction ratio of 6 OD. They are connected to the source via a 50/50 beam-splitter. 
To choose the measurement basis at both locations, two independent random bit sequences are utilized. This way, only 
the two randomly selected outputs associated with the given basis are considered, the other two being disregarded [2] . 
This configuration therefore requires the use of 8 single photon detectors, being all Silicon avalanche photodiodes. 
Alice employs 4 Excelitas SPCM-AQR-16-FC featuring ^60% detection efficiency and dark count rates of ^100 Hz, 
while Bob has 4 QuTools featuring ~50% detection efficiency and dark count levels of ~2000 Hz. Note that the 
detectors’ configuration is chosen in such a way that it can satisfy, as much as possible, the assumption = rjx'^y' 
for all X, y, x' , and y' with an overall efficiency of about 5% (fibre coupling and propagation losses included). The 
detection arrival times of all the detection events are recorded by a time-to-digital converter (not shown) and the 
random choice of the basis is applied during the data post-processing thanks to two different quantum random number 
generators. 
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